Analyzing symmetry breaking within a chaotic quantum system via Bayesian inference 
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Bayesian inference is applied to the level fluctuations of two coupled microwave billiards in order 
to extract the coupling strength. The coupled resonators provide a model of a chaotic quantum 
system containing two coupled symmetry classes of levels. The number variance is used to quantify 
the level fluctuations as a function of the coupling and to construct the conditional probability 
distribution of the data. The prior distribution of the coupling parameter is obtained from an 
invariance argument on the entropy of the posterior distribution. 
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I. INTRODUCTION 

The subject of the present paper is Bayesian inference 
as appUed to the experiment of Ref. B in order to extract 
the mean square matrix element coupling two chaotic 
classes of quantum states. The Bayesian procedure de- 
scribed below does not contain any arbitrary element: 
The prior distribution — sometimes left to the educated 
guess of the analyst H — is determined by an invariance 
argument on the entropy of the posterior distribution. 

The present article is organized as follows. In Sec. ||, 
we briefly describe the experiment with superconducting 
microwave resonators that has provided the data for the 
present analysis. The random matrix model for the cou- 
pling of two symmetry classes of chaotic states is defined 
in Sec. 



[II 



A 



It yields — in analytic form — the depen- 
dence of the observable on the coupling strength which is 
to be determined. Bayesian inference, especially the def- 
inition of the prior distribution, is discussed in Sec. IV. 
The conditional probability distribution of the data is 
defined in Sec. p . Th e results are given in Sec. VI 
discussion in Sec. |VIl| concludes the paper. 



II. THE EXPERIMENT WITH COUPLED 
MICROWAVE RESONATORS 

Billiards provide models of classical and quantum me- 
chanical chaos. They have been studied extensively, see 
the review article pi . Quantum mechanical billiards can 
be simulated by flat microwave resonators iH-R]. One 
class of these "quantum" billiards are the Bunimovich 
stadium billiards S experimentally investigated in Refs. 

These investigations show that the fluctuation prop- 
erties of the quantum chaotic systems with well defined 
symmetries are described by Dyson's matrix ensembles 
1^. In the case of the stadium billiards, the correct de- 
scription is provided by the Gaussian Orthogonal En- 
semble (GOE). This means, e.g., that the fluctuations 
of the positions of the eigenmodes — shortly the level 
fluctuations — are the same as the fluctuations of the 



eigenvalues of random matrices drawn from the GOE. In 
order to assess these fluctuations, various statistics have 
been defined — such as the distance of neighboring levels 
or the variance of the number of levels in a given inter- 
val. The expectation values of these statistics have been 
worked out [ [l3|Jl4[| for comparison with data such as the 
present ones. 

In the previous work |l| , the level positions of a system 
have been measured that consisted of two (quarters of) 
stadium billiards coupled electromagnetically. See Fig. [|. 
The technical realization of the coupling has been de- 
scribed in Ref. H]. In the frequency range of to 16 
GHz, the complete spectra of the two stadia displayed 
608 and 883 resonances in the (7=1) stadium and the 
(7 — 1.8) stadium, respectively. The mean level spacing 
is D = 10.7 MHz. 

In Fig. 0, small pieces of spectra are shown for three 
different couplings. The arrows shall help to recognize 
that — due to the couphng — the resonances are shifted 
by statistically varying amounts. 

This system simulates two symmetry classes of levels 
coupled by a symmetry breaking interaction. Each class 
of levels — represented by each of the uncoupled stadia 
— can be identified with a chaotic system of well defined 
symmetry having the properties of the GOE. The entire 
system of the coupled stadia no longer has the universal 
properties of the GOE. Its properties are a function of a 
suitably defined coupling parameter A. 
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1. Shapes of the two coupled stadium billiards. The 



circles inside the squares indicate the positions of the anten- 
nas used to scatter microwave power through the system in 
order to find the eigenfrequencies of the entire system. The 
parameter 7 is the ratio between the length of the rectangular 
part and the radius of the circular part of the resonator. The 
vertical heights of the stadia are given in Sec. [Vl|. 
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FIG. 2. Three spectra — within a small range of frequen- 
cies — taken at different couplings between the two res- 
onators. The coupling increases from top to bottom. Its 
notation (x,y) is explained in Sec. [VI| . The arrows shall help 
to recognize the shifts of a few resonances. 

The investigation of symmetry breaking in chaotic 
quantum systems is not a recent challenge to physicists 
|l5| . Good examples of the experimental and theoretical 
efforts already invested into this problem, are the cases 
of isospin mixing |16-Iq], of parity violation in heavy 
nuclei |]19[ , of the breaking of certain atomic and molecu- 
lar symmetries |lq,pQ]. The experiment performed in R] 
provides a general model for these case studies. 

In the present paper, we do not describe any one of 
the specific case studies; we shall not even describe in 
more detail the model experiment of Ref . |1|] . We rather 
describe — in the next section — the model experiment 
[y in an abstract mathematical form and then turn to 
its analysis in Sees. pv[-|VII|. 



III. THE MATHEMATICAL MODEL OF 

SYMMETRY BREAKING IN A CHAOTIC 

QUANTUM SYSTEM 

In the absence of coupling each eigenstate of the system 
of Fig. Q can be characterized as belonging to either res- 
onator 1 or resonator 2. This is equivalent to the assign- 
ment of a quantum number 7. The spectrum of states of 
each 7 has the statistical properties of the eigenvalues of 
matrices drawn from the GOE. The superposition of the 
two spectra displays what we shall call 2 GOE behavior. 
It can be described by a block-diagonal Hamilton oper- 



ator where each block is an element of the GOE, hence, 
by the first term of the Hamiltonian 



n = 




(1) 



For a 7^ 0, the off-diagonal matrix V in the second term 
on the r.h.s. provides the coupling between both sym- 
metry classes. It has Gaussian random elements — as 
the GOE blocks. If the two GOE blocks have the same 
dimension then their elements as well as the elements of 
V shall all have the same rms value. Then a = 1 turns 
Ti as a whole into a GOE matrix p^]. The resulting 
spectrum displays what we call 1 GOE behavior. If the 
two GOE blocks have different dimensions, then the rms 
values must be chosen such that their spectra have the 
same length. The details are given in |]1^. This model 
is a special case of the model by Rosenzweig and Porter 

The parameter that governs the level statistics is av/D 
rather than a. Here, D is the mean level distance of H. 
See Refs. lljJlq]. In the sequel, the coupling parameter 



/av\'- 



A=(ij (2) 

will be used. Often the coupling strength is also 
parametrized in terms of the spreading width 



r^ = 27r 



D 

: 2nAD. 



(3) 



The statistic used in the present paper in order to 
characterize the behavior of the data, is the so-called S^ 
statistic or number variance. It is the variance z{L) of 
the number n{L) of levels found in an interval of length 
LD, i.e. 



z{L) = {iniL)^Lf). 



(4) 



Here, the angular brackets () denote the average over 
all pieces of spectra of length L that have been cut out 
of the entire experimental spectrum. The procedure is 

described in Sec. ^ 

The expectation value z{L) with respect to the statis- 
tical ensemble defined by Eq. (|l|) is called S]^(£, A). This 
function has been calculated by French et al. [|l3[ and by 
Leitner et al. [|l^. According to Q, it is 

S2(L,A) 



ziL) 



1 



= S^(L,cx)) + — ln( 1 
Here, I]^(L, 00) is the expression 



4(T + 7r2A)2 



(5) 



S^(L, cx)) = ^ <^ ln(27rL) +7^ + 1 



2 m7rL)Y 



-fSi(vrL) 



+ Tr^L 



1 



cos(27rL) - Ci(27ri) 



-Si(27rL) 



(6) 



It describes the 1 GOE behavior. The second term on 
the r.h.s. of Eq. ^ obviously vanishes for A ^- oo. 

In Eq. (y), "fE is Euler's constant and Si, Ci are the 
sine and cosine integrals defined e.g. in paragraph 8.23 of 
p2[ . The parameter r is a function of the ratio between 
the dimensions of the two GOE blocks in the first term 
of Eq. (||). In the present situation, it is equal to 0.74. 

The function S^ depends on the coupling parameter 
A — as is illustrated by Fig. 0. Therefore A can be in- 
ferred from the experimental number variance z{L) . The 
principle of this inference is described in the next section. 
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FIG. 3. The expectation value E^(L, A) of the number vari- 
ance z(V) as a function of L for various coupling strengths A 
— ranging from 2 GOE to 1 GOE behavior. The abscissa has 
been limited to L < 5 because the data analyzed below are in 
the range of 1 < L < 5. 



IV. BAYESIAN INFERENCE 

Suppose that a set of experimental data z^, k = 1...M, 
is given which depends on a parameter A in the sense 
that the probability distribution Wk of the event Zk is 
conditioned by the hypothesis A, 



Wk = Wk{zk I A). 



(7) 



The events z^ shall be statistically independent of each 
other. The joint distribution W of the Zk, k = 1...M, is 
then 



M 



Wiz\A) = l[wkizk\A). 



(8) 



fc=i 



From this follows the distribution P(A | z) of A under the 
condition that the data z are given via Bayes' theorem 



P(A I z) = 



W{z\A)^,{A) 
m{z) 



(9) 



Here, /i(A) is the so-called prior distribution. It is the 
measure of integration in the space of A. One must de- 
fine it such that it represents ignorance on A — in a sense 
described below. The function m(z) is the prior distri- 
bution of z. It is not independent of fj,; it is given by the 
normalizing integral 



m(z) = / dA W{z I A)^(A). 



(10) 



In the framework of the logic underlying Eq. (O), a prob- 
ability distribution of — say — A is considered to rep- 
resent the available knowledge on A and the prior distri- 
bution corresponds to "ignorance about A" . 

The definition of /i(A) deserves a detailed comment. 
First of all, the natural choice of /i(A) is not the con- 
stant function because a reparametrization A -^ ^(A) 
will transform /i(A) into 



Mt(A) = fi{A) 



dA 
dA 



(11) 



Unless the transformation is linear, it turns a uniform 
distribution into a non-uniform one. 

We define /i(A) such that the entropy of P{A\z) does 
not depend on the true value A that governs the distri- 
bution of the data z — {zi...zm)- The data follow the 
distribution IF(z|A). Although A is not known, it is sup- 
posed to be a well defined number. If it is shifted to 
another value A' and one takes new data z' and con- 
structs the posterior distribution P' = P(A|z') from the 
new data, then one can expect P' to be shifted with re- 
spect to P. The distribution P' will be centered in the 
vicinity of A' rather than A. However, we want to make 
sure that the "spread" of P' is the same as that of P; 
that is, the entropy of P and P' shall be the same — for 
a given number of data M. In this sense, no value of A 
is a priori preferred over any other one. 

The definition of the entropy requires some attention. 
The usual formula — J dA P In P for the entropy is of 
too restricted validity in the present context, because 
this expression is not invariant under a reparametriza- 
tion A -^ A(A). The general expression for the entropy 
is 



H 



dAP{A\z) In 



P(A|^) 

m(A) 



(12) 



which is independent of a reparametrization |£3|,g4| , be- 
cause the transformations of both distributions, P and /i, 
are performed according to (pT|). Therefore the deriva- 
tive jdA/dAj drops out of the argument of the logarithm 
and expression (112) is left unchanged by the substitution 
A^ A. 



It is possible to define fj, such that H is independent of 
the true value A, if W possesses the property introduced 
in p4|-pq| called form invariance. It states that there is a 
group of transformations Qp such that the simultaneous 
transformation of z and A leaves W invariant, i.e. 



WiGpzlGpA) dGpZ = W{z\A) dz. 



(13) 



The group parameter p must have the same domain of 
definition as the hypothesis A. If one chooses /x(A) to be 
the invariant measure of the group then it is not difhcult 
to show that the posterior distribution P also possesses 
the invariance (113). This entails that H is invariant un- 
der any transformation z — > QpZ of the data. However, 
by Eq. (03) this is just what happens to a given data set 



if the true value A is shifted to A' = Qp^K. 

There is a handy formula that yields the invariant mea- 
sure without any study of the structure of the group. It 



M(A) 



M- 



d^''zW{z I K)—^\nW{z \ A) 



1/2 



(14) 



even before form in- 



and was proposed by Jeffreys 
variance was discussed. 

Not every conditional distribution W possesses a sym- 
metry (|l3). Even if this is not the case, expression (|lj) 
ensures that H is approximately independent of the true 
value of A. This holds in the following sense: For every A 
one can replace the correct distribution W{z \ A) by an 
approximation Wapp{z | A) which is form invariant. The 
approximate and the correct distributions agree within 
the fourth order of A — A. Equation (|4|) yields the in- 
variant measure of the approximation to within the third 

order of A - A |28). 

ensures that no value of 



In summary: expression (h4h 
A is a priori preferred over any other one if the form 
invarianc e (|l3| ) exists. If there is no form invariance, ex- 
pression (|14|) approximately ensures this. Therefore (llj) 
is the best recommendation in any case. 

Neither the group theoretic argument nor Jeffreys' rule 
nor information theoretic arguments are new in the dis- 
cussion of the Bayesian prior. However, the way in which 
they are related justifies the present digression on a fun- 
damental issue. We omit to show how and why the 
present arguments are related to the geometric consid- 
erations which were introduced by Amari p3 and are 
currently put forward by Rodriguez [Q. These authors 
agree on the result ([l4|). 

The posterior distribution P is used to construct an 
interval of error often called a confidence interval. It is 
the shortest interval that contains A with probability K. 
The usual error is defined with the confidence K = 0.68. 

The posterior distribution P approaches a Gaussian 
for M ^- cx) provided that the true value of A is not on 
the border of the domain of definition of A. One can 
prove that the variance of the Gaussian is proportional 
to M~^. Hence, with increasing M the posterior distri- 
bution P will become so narrow that /i changes very little 



in the domain where P is essentially different from zero. 
Note that p. does not depend on M. Then /i drops out 
of expression (O). If this happens, the present Bayesian 
analysis becomes equivalent to a x^ fit of S^(i, A) to the 
experimental points z{L). The standard procedure of the 
X^ fit can e.g. be found in [pl| . It does not require a prior 
distribution. 

If P is not Gaussian, the y^ fit yields meaningless con- 
fidence intervals. Then Bayesian inference cannot be by- 
passed. In the example presented below this happens 
in the limit of small coupling between the resonators: 
Eventually, the posterior distribution P decreases mono- 
tonically. The experiment is then compatible with zero 
coupling because the shortest confidence interval contains 
the point A = for any K . The point of zero coupling is 
on the border of the domain of definition of A. 



V. THE DISTRIBUTION OF THE DATA 

Spectral fluctuation properties can only be studied af- 
ter secular variations of the level density have been re- 
moved, i.e. after the frequency scale has been transformed 
such that the level density becomes unity within the in- 
terval covered by the experiment. This procedure — of- 
ten called "unfolding" the spectrum — is a standard one 
|p2| and has been applied. 

After this we defined — for a given interval of length L 
— Nl adjacent intervals. The intervals did not overlap 
and no space was left in between them. This means 



Nl^ 



length of spectrum 



(15) 



where the square brackets designate the largest integer 
contained in the fraction. For each interval, the num- 
ber n{L) of levels occuring within it was counted and 
the squared difference {n{L) — L)"^ was averaged over the 
Nl intervals. This defines the average () introduced in 
Eq. (Q) and, hence, the "event" z{L). This procedure was 
repeated for a set of values Lfc, k = 1...M, to be defined 
below. In this way, M events 



Zk = z{Lk) 



(16) 



were obtained. 

The Bayesian procedure outlined in the previous sec- 
tion requires that one assigns a probability distribution 
Wk{zk I A) to each event. The Zk are statistical quanti- 
ties in the following sense: If another spectrum would be 
provided that had the same statistical properties as the 
measured one and the data Zk would be constructed in 
the same way as above, they would of course not coincide 
with the data obtained from the actually measured spec- 
trum — precisely because the levels are subject to statis- 
tical fluctuations. If one could go through the ensemble 
of spectra in this way, one would obtain an ensemble of 



data Zk- We are looking for the distribution Wk of this en- 
semble. Since there is only the single measured spectrum 
and since no theory yielding Wk is available, we have gen- 
erated the distribution of Zk by Efron's bootstrap method 
p3[ . This method generates the distribution numerically 
by drawing at random and independently a new set of 
Nl^ intervals from the Nl^. original intervals. A new Zk 
is produced from this new set of intervals. Repeating 
this many times, a distribution of Zk is generated which 
is identified with the distribution Wk of the Zk- Note that 
Nl^ is always large, namely N^^^ > 300. 

For Lk > 1, the distribution Wk was in this way found 
to be a x^ distribution with Nl^ degrees of freedom — 
which intuitively seems reasonable. They are close to 
Gaussians with variance 2/Nl^. As mentioned above in 
Sec. [IL the mean value of this distribution is 



Zk = dz zwk{z I A) 
= Y?{Lk,h). 

x Eq. §) — 



(17) 



the distribution 



Since S^ depends on A — 
Wk depends on A. 

We have restricted the analysis to L^ > 1. In the do- 
main of Lfe < 1 , the number variance so weakly depends 
on the parameter A that one does not give away much 
information by this restriction. 

In order to avoid an unnecessarily complicated distri- 
bution of the Zk, we want to be sure that there are no 
correlations between Zk and Zk>, for k ^ k' . It was there- 
fore necessary to determine the minimum e of the dis- 
tance I Lfc — Lk' I that would still allow for statistically 
independent z^, Zk' . Indeed if | L^ — Lk' \ is very small 
then most of the intervals associated with Lk will almost 
coincide with an interval associated with Lk' ■ As a conse- 
quence, many of the numbers n{Lk) found in the intervals 
associated with Lk will occur also in the intervals asso- 
ciated with Lk'. Hence, Zk will not be independent from 
Zk' ■ In order to determine e, we have calculated z{L) as 
a function of L in steps of 0.001. For a small range of 
L, the result is given in Fig. 0. Indeed over a distance 
of a few times this step width, z{L) changes little. If 
\ L — L' \ is many times this step width, then z{L) and 
z{L') show independent fluctuations. In principle, one 
can study the decay of the correlations as a function of 
I i — L' I by constructing the autocorrelation function of 
z{L). We have contented ourselves to inspect Fig. Q and 
similar plots for different domains of L. It seems obvious 
from Fig. ||that the typical width of the structures is less 
than 0.025. This justifies to set 



to define 



e = 0.025, 



l + (fc-l)e, k^l,2,...M 



(18) 



(19) 




4.90 



4.92 



4.94 



4.96 



4.98 



5.00 



and to assume that Zk is statistically independent of Zk' 
for k ^ k'. 



FIG. 4. The experimental estimate z{L) of the number 
variance — see Eq. (W) — as a function of L calculated in 
steps of AL = 0.001. The typical width of the structures in 
this fluctuating function determines the range e over which 
z{L) and z{L + e) may be correlated. Conversely, it serves 
to estimate the minimum distance e between L and L' which 
must be respected if z{L) shall not be correlated with z{L'). 

There is an upper limit Lmax of L that one must be 
aware of: The spectral fluctuations of levels from billiards 
agree with those of random matrices — i.e. they are uni- 
versal — within intervals of a maximum length which is 
inversely proportional to the length of the shortest peri- 
odic orbit in the billiard ||34| , p5|| . This requires Lmax = 5 
here. 

Hence, data z{L) for 1 < L < 5 were used to obtain the 
results presented below. This means that by Eqs. (18 1^) 
the number of statistically independent data points is 
M ^ 161. 

Let us note that one can devise definitions of the set of 
intervals with given length L other than adjacent inter- 
vals — as was done here. One can admit a certain overlap 
between them as suggested in |Q or one can place them 
at random |3^. We have tried these alternatives and 
have made sure that they do not significantly change the 
results presented below. 



VI. RESULTS 

The data z{Lk), k — 1, ..., 161, are given on Fig. for 
the six different couplings that were experimentally in- 
vestigated. The coupling strength increases from top to 
bottom on Fig. pi Its experimental realization is indi- 
cated by the two numbers (xi, X2) in brackets that label 
the six parts of the figure. They are explained as follows: 



The billiards were positioned with their flat sides against 
each other. Holes were drilled through the walls of the 
resonators such that a niobium pin could be inserted per- 
pendicularly to the plane of the billiards through the 
(7 = 1.8) stadium into the (7 — 1) stadium. The cou- 
pling strength is determined by the depths x\ and X2 by 
which the niobium pin penetrates into the (7 = 1) and 
the (7 — 1.8) stadium, respectively. These depths are 
given by {xx^x-i) in mm. The net height of the (7 = 1) 
stadium was 7 mm and that of the (7 = 1.8) stadium 
was 8 mm. For the strongest coupling — i.e. the bottom 
part of the figure — a second niobium pin, penetrating 
all the way through both resonators, was added. The 
coupHng (0,8) — i.e. the top part of the figure — is the 
case, where the billiards should be decoupled. 




FIG. 5. The number variance z(V) (crosses) and its ex- 
pectation value E^(L, A) (full lines) for various experimental 
couplings (x\,X2)- The dashed lines display the 2 GOE behav- 
ior, i.e. E^(L,0). The dotted lines give the 1 GOE behavior, 
i.e. Y?(L,oo). 

The dashed lines on Fig. illustrate the limiting case of 
2 GOE behavior, i.e. expression (^ with A = 0. The dot- 
ted lines show the limit of 1 GOE behavior, i.e. expression 
(O). Obviously, all six cases are not easily distinguished 
from the 2 GOE behavior, i.e. A = 0. 



Prior to the analysis it was therefore not clear whether 
the six experimental cases would yield distinguishable 
coupling parameters A and whether these would even be 
distinguishable from zero. The latter question means ac- 
cording to Sec. 1^: It was not clear whether a y^ fit would 
be appropriate. Therefore the whole analysis was based 
on Bayesian inference. The prior distribution was calcu- 
lated from (|lj). The probability distribution i(jfe(zfe|A) 
of the data has been defined in Sec. M. Whether form 
invariance exists has not been investigated. The scat- 
ter of the data is quite large — especially for L close 
to 5. These fluctuations are assessed by the distribution 
Wk{zk I A). The fluctuations increase with increasing L. 
This is reflected by the fact that Wk was found to be a 
X^ distribution with N^^ degrees of freedom. Its rela- 
tive rms deviation is ^J^JW^l and iV^^^ decreases with 
increasing Lfc, see Eq. (15). Despite the scatter of the 



data the coupling parameter is so well determined that 
the analysis distinguishes the six experimental cases from 
each other — because the number of data points is large 
enough. 

For all cases except coupling (0,8), the posterior distri- 
bution (£) turned out to be Gaussian. This is illustrated 
on Fig. 3 for the coupling (5,3). In the case of couphng 
(0,8) — which is expected to show 2 GOE behavior — 
the posterior distribution of A is the monotonically de- 
creasing function of Fig. |7[ This is reasonable because 
the shortest confidence interval on A will — for any con- 
fidence — include the possibility of A = 0. Hence, the 
distribution of Fig. ^ allows to state only an upper limit 
for A. 
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FIG. 6. Posterior distribution for the coupling (5,3), i.e. the 
second case on Fig. ^a and in Table |. The center Ao and the 
rms deviation of the Gaussian are specified. 



TABLE I. Parameters found for six different experimental couplings. The results have been obtained via Bayesian inference 
as outlined in Sec. |V|. The column headings are explained in the text. 



Exp. coupling 


A 


V^/D 


av/D 


av 


x' 


Afit 


(0,8) 


< 0.00085 


< 0.005 


< 0.029 


<0.31 


1.14 




(5,3) 


0.011 ±0.002 


0.07 ±0.01 


0.105 ±0.008 


1.12 ±0.09 


0.90 


0.013 ± 0.002 


(4,4) 


0.017 ±0.002 


0.11 ±0.01 


0.130 ±0.007 


1.39 ±0.07 


1.04 


0.019 ± 0.002 


(5,8) 


0.030 ± 0.002 


0.19 ±0.01 


0.173 ±0.006 


1.85 ±0.06 


1.11 


0.033 ± 0.002 


(6,8) 


0.032 ± 0.002 


0.20 ±0.01 


0.180 ±0.006 


1.93 ±0.06 


1.47 


0.037 ± 0.003 


(6,8)+(7,8) 


0.040 ± 0.002 


0.25 ±0.01 


0.200 ± 0.006 


2.14 ±0.06 


1.24 


0.044 ± 0.002 



The results of the Bayesian analysis are summarized in 
the first five columns of Table L The first column char- 
acterizes the experimental realization of the coupling as 
explained above. In the second column, the coupling pa- 
rameter A is given. It can also be expressed (in the third 
column) by the ratio T^/D, see Eq. (^). Alternatively 
— see Eq. (g) — the combination av/D of parameters 
of the model of Eq. dh) is given in the fourth column. 
By putting D equal to the mean level distance D — 10.7 
MHz of the experiment, one obtains in the fifth column 
the rms coupling matrix element av in MHz. 

In the case of coupling (0,8), where only an upper limit 
for the coupling can be given, we have done so — for the 
confidence of 68%. In all other cases the center Aq of the 
Gaussian posterior is given together with the rms devia- 
tion; this defines a 68% confidence interval. 
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FIG. 7. Posterior distribution for the coupling (0,8), i.e. the 
case where the coupling is expected to be zero. The probabil- 
ity integrated from A = to 0.00085 (marked by an arrow) is 

68% . 



According to Sec. IV, Gaussian posteriors suggest that 
one may replace Bayesian inference by a x^ fit which 
is simpler. A x^ fit has been performed in all cases 
and the results are given in the last two columns of 



the table. The sixth column displays the normalized x^ 
value. For a reasonable fit, it should lie between 0.84 
and 1.16. This follows from the fact that the distribu- 
tion of x^ is here approximately Gaussian with rms value 
(2/(number of degrees of freedom))i/2 = (2/160)1/2 ~ 

0.16. The seventh column gives the coupling parameters 
A which the fit has found. They are compatible with the 
Bayesian results except for the first entry (0,8). Here the 
fit puts out a negative value, i.e. it does not produce a 
meaningful result. This was expected from the discussion 



in Sec. [V. 



VII. DISCUSSION 

The emphasis of the present paper is on the Bayesian 
analysis of the data. Although Bayes' theorem provides a 
clear and simple prescription of how to draw conclusions 
from data about a hypothesis conditioning the data, its 
use was hampered for a long time by the difficulty to de- 
fine the prior distribution /i of the hypothesis. Equation 
(|l4| ) is a very general definition of /i. It applies even to 
cases, where the variable z of the events is discrete (the 
integral in (nj) then means a sum). The prior distri- 
bution (y_^ ensures that the amount of information one 
gets on the hypothesis A is — at least approximately — 
independent of the true value of A. 

Supplemented by Eq. (|4|), Bayes' theorem provides 
the generalization of all methods of inference that rely 
on Gaussian approximations. The method of the least 
squares e.g. belongs to them. It does not require a prior 
distribution of the parameter to be determined. In the 
present paper the relation between Bayesian inference 
and x^ fit has been discussed. A criterion has been given 
under which Bayesian inference is approximately equiva- 
lent to the simpler fit procedure. This criterion has been 
substantiated numerically. 

The present formalism especially provides the correct 
treatment of null-experiments, i.e. of experiments that 
yield only an upper limit for the parameter of interest. 
An example for this situation has been presented. By 
the same token, the formalism of Sec. IV provides the 



decision whether the parameter is compatible with zero. 
The physical results of the present analysis show that 
the strongest coupling realized in the microwave experi- 
ment IT] has about the same size as the coupling found 
in [l^l to occur between states of different isospin in 



^^Al. The strongest coupling treated in the present pa- 
per causes about 25% mixing between the two classes of 
levels, i.e. a state which can be approximately assigned 
to the (7 = 1) stadium contains about 25% strength from 
the configurations of the (7 — 1.8) stadium — and vice 
versa. This is the interpretation of the value of T^ /D in 
Table I. Data that are as numerous and precise as those 
of Ref. ||l| allow to detect T^/D ten times smaller than 
the result of [Q — according to the present analysis. 
Nuclear data — which never provide as large a sample 
of states as the experiment |^ — would not allow to de- 
tect r^/D = 0.07 (the smallest detected mixing in Table 
I) from the level fluctuations. The precision obtained in 
this experiment has allowed to detect the subtle depen- 
dence of the level fluctuations on the breaking of a sym- 
metry which is predicted by the random matrix model 
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